Human Action Recognition and Shape Segmentation-Recognition

نویسنده

  • Jianbo Shi
چکیده

Human Action Recognition. Human action recognition has broad range of applications such as video search, sports analysis, human robotics interactions, and health care. Our work is organized in two directions: 1) detailed pixel-level ‘motion and pose’, focusing on close interactions among people; 2) action recognition focusing on goal oriented motion, simplified as ‘action = motion + intention’. In “Detecting Unusual Activity in Video” (cvpr2004), we demonstrated that using large amount of un-labeled video data and a robust graph co-clustering approach, one can uncover visual patterns of un-usual and usual actions. This was an exciting discovery, as it suggested that big-data can solve this hard vision problem without explicitly defining action categories, and without detailed analysis of human motion. Through more experiments, it was clear that such big-data approach has an ‘autistic’ limitation: it memorizes many details, but understands little intricate relationships of human motion and causality among them. It has little ability to make long-range prediction of future actions. My recent work on human action recognition is aimed to resolve this ‘autistic’ limitation. “Action = motion + intention”. Intention as And-Or graph of Actor-Actions: Storyline model. In “Understanding Videos, Constructing Plots: Learning a Visually Grounded Storyline Model from Annotated Video” (cvpr2009), we studied causality among actions. Our Storyline model can be regarded as a stochastic spatio-temporal grammar, whose language (individual storylines) represents potential plausible explanations of new videos in a domain. The basic insight is that all the variations of an event share a goal directed sequence of actions (akin to how movies of the same genre has similar flow of story subplots). The variations, such as one falls down as he runs towards a car, are due to different effects of each action. Our method requires only short video segments accompanied by a text description of the actors and actions present in the video. The system requires no detail annotations of the actor and actions in the video. We model the storyline grammar as a probabilistic AND-OR graph. The Storyline inference

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human action segmentation and recognition via motion and shape analysis

In this paper, we present an automated video analysis system which addresses segmentation and detection of human actions in an indoor environment, such as a gym. The system aims at segmenting different movements from the input video and recognizing the action types simultaneously. Two action segmentation techniques, namely color intensity based and motion based, are proposed. Both methods can e...

متن کامل

Evaluation of the Parameters Involved in the Iris Recognition System

Biometric recognition is an automatic identification method which is based on unique features or characteristics possessed by human beings and Iris recognition has proved itself as one of the most reliable biometric methods available owing to the accuracy provided by its unique epigenetic patterns. The main steps in any iris recognition system are image acquisition, iris segmentation, iris norm...

متن کامل

A New IRIS Segmentation Method Based on Sparse Representation

Iris recognition is one of the most reliable methods for identification. In general, itconsists of image acquisition, iris segmentation, feature extraction and matching. Among them, iris segmentation has an important role on the performance of any iris recognition system. Eyes nonlinear movement, occlusion, and specular reflection are main challenges for any iris segmentation method. In thi...

متن کامل

A New IRIS Segmentation Method Based on Sparse Representation

Iris recognition is one of the most reliable methods for identification. In general, itconsists of image acquisition, iris segmentation, feature extraction and matching. Among them, iris segmentation has an important role on the performance of any iris recognition system. Eyes nonlinear movement, occlusion, and specular reflection are main challenges for any iris segmentation method. In thi...

متن کامل

Training Set of Data Bin for Small Black Pixels Neighborhood Recognition of Each Boundary

We first describe how to “fuzzify” the estimated binary columns to create a [0,1]-valued column. Werefer to this [0,1] -valued column as the soft segmentation column of the noisy spectrogram column.Similarly to the collection of soft segmentation columns as the soft segmentation image, or simply asthe soft segmentation. The band-dependent posterior probability that the hard segmentation columnv...

متن کامل

A Tree-Based Approach to Integrated Action Localization, Recognition and Segmentation

A tree-based approach to integrated action segmentation, localization and recognition is proposed. An action is represented as a sequence of joint hog-flow descriptors extracted independently from each frame. During training, a set of action prototypes is first learned based on a k-means clustering, and then a binary tree model is constructed from the set of action prototypes based on hierarchi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013